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Cell polarity refers to the spatial asymmetry of molecules on the cell membrane. 
Altschuler, Angenent, Wang and Wu have proposed a stochastic model for studying 
the emergence of polarity in the presence of feedback between molecules. We ana- 
■ lyze their model further by representing it as a model of an evolving population with 

interacting individuals. Under a suitable scaling of parameters, we show that in the 
| infinite population limit we get a Fleming- Viot process. Using well-known results for 

such processes, we establish that cell polarity is exhibited by the model and also study 
its dependence on the biological parameters of the model. 
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1 Introduction 
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X 

The phenomenon of polarity is ubiquitous in living organisms. It is known to occur at many 



levels : from cellular to organismic. Polarity is what causes one part of a biological system 
to be different from another. Understanding how polarity is established and maintained is a 
matter of fundamental concern for biologists. 

In this paper we are interested in polarity at the level of individual cells. Consider a 
spherical cell consisting of the cytosol and the membrane. Suppose that it contains numerous 
molecules that may either reside in the cytosol or on the membrane. The phenomenon of cell 
polarity refers to an identifiable form of spatial asymmetry of molecules on the membrane. 
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Biologists generally consider a cell to be in a polarized state when most of the membrane 
molecules appear to be concentrated around a single site or located in a single hemisphere 
on the membrane. It is known that many types of cells exhibit this phenomenon. The most 
common example is the yeast cell (see [35, 34, 22, 30]) but there are many others (see [9, 10]). 
As noted in [9], cell polarity is vital in the creation of functionally specialized regions on the 
membrane, which can then facilitate cellular processes such as localized membrane growth, 
activation of immune response, directional cell migration and vectorial transport of molecules 
across cell layers. 

Due to its importance, many attempts have been made to investigate the mechanisms 
responsible for cell polarity. Drubin and Nelson [9] mention that the existence of cell polarity 
involves positive feedback from the signalling molecules on the membrane. This feedback 
enables the signalling molecules to perform localized recruitment, thereby causing concen- 
tration of molecules in a specified region on the membrane. Examples of such signalling 
molecules include, Cdc 42 in budding yeast (see [3]), mPar3/mPar6 in neurons (see [38]), 
Rac in kidney cells (see [18]) and human chemotaxing neutrophils (see [44]). Even though 
the feedback mechanism may bring the molecules together, it is unclear if it can generate 
cell polarity alone. This is because the molecules on the membrane are constantly diffusing 
and hence any clusters that form may disappear quickly with time. Biologists have proposed 
that additional mechanisms like directed transport and coupled inhibitors are required to 
counter the spatial diffusion and generate spatial asymmetry (see [19, 22, 31, 41, 43]). How- 
ever these additional mechanisms are not always found in cells that exhibit polarity. Hence 
the question arises whether feedback alone can cause polarization. 

Altschuler, Angenent, Wang and Wu [1] show that indeed feedback alone can generate 
cell polarity when the number of molecules is small. They prove this result via a simple 
mathematical model derived by abstracting the feedback circuits found in cells. In their 
model, the feedback mechanism is given by the following : a molecule on the membrane may 
pull a molecule from the cytosol to its location on the membrane. In a stochastic setting they 
show that their model exhibits recurring cell polarity. However the frequency of polarity is 
inversely proportional to the number of molecules in the cell. This suggests that no polarity 
can persist in the infinite population limit without any additional mechanisms to reinforce 
asymmetry. 

In this paper we will scale some parameters of the model in [1] and study the resulting 
model. The main result of our paper is that if we let the feedback strength of each mem- 
brane bound molecule to increase linearly with the population size then we do get recurring 
cell polarity in the infinite population limit. Hence under our scaling, the model suggests 
that feedback alone can generate cell polarity in the infinite population limit without any 
additional mechanisms. Our approach is to express the dynamics of cell molecules as a 
measure- valued Markov process and then prove that in the limit, the dynamics of molecules 
on the membrane can be described by a Fleming- Viot process. This process was introduced 
by Fleming and Viot [17] in 1979 and it has been very well-studied since then. An excellent 
survey of Fleming- Viot processes is given by Ethier and Kurtz [12]. Using the results already 
known for such processes we will first show that the limiting process is ergodic and hence has 
a unique stationary distribution. We will then illustrate that at stationarity the membrane 
molecules are arranged into clans of various sizes and molecules in a clan are spatially clus- 
tered. Moreover the distribution of clan sizes and the expected spatial spread of the clans 
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can be readily computed in terms of the biological parameters of the model. Our results will 
allow us to deduce that there are times when most of the molecules are part of a single clan 
and lie in a single hemisphere on the membrane, thereby causing a cell to polarize. We now 
describe the model given in [1]. 

Description 1.1 There are N molecules in the cell (cytosol and membrane). The cell itself 
is a sphere of radius R. The following four events can change the molecular configuration in 
the cell. 

• Spontaneous membrane association: A molecule in the cytosol moves to a random 
location on the membrane at rate k on . 

• Spontaneous membrane dissociation: A molecule on the membrane moves back 
into the cytosol at rate k jj. 

• Membrane association through recruitment (feedback mechanism) : A molecule 
on the membrane recruits another molecule from the cytosol at rate: 

kfb x (fraction of molecules in the cytosol) . 

At the time of recruitment, the recruited particle moves to the location of the recruiting 
particle. 

• Membrane diffusion: Each molecule on the membrane does Brownian motion with 
speed D. 

The parameters of the model N, D, R, k on , fcfb and k Q s have clear biological interpreta- 
tions. As mentioned in [1], fcfb and fc g are comparable and throughout this paper we will 
assume that 

Assumption 1.2 

kfl, > k off > 0. 

In this paper we scale up fcfb and fc Q ff by the population size N and leave fc on the same. We 
show that under this scaling the model becomes mathematically tractable as N — > oo. In 
Section 3 we will discuss the choice of this scaling and the necessity of Assumption 1.2. 

Since we will be relating this model to a well-known model in population genetics, it is 
convenient to think of cell molecules as individuals in an evolving population. Consider the 
membrane molecules as being alive and the cytosol molecules as being dead. Each membrane 
molecule has two attributes: location and clan indicator. When a membrane molecule 
recruits another molecule from the cytosol, this new molecule gets initially assigned the same 
location and clan indicator as the recruiting molecule. The location of this new molecule 
will change subsequently as it does its own Brownian motion but its clan indicator remains 
the same. We can think of membrane recruitment as a birth process in which the recruiting 
membrane molecule (the parent) passes its characteristics to the recruited molecule (the 
offspring). The membrane molecules that have the same clan indicator are said to be in the 
same clan, which implies that they have a common ancestor. When a molecule spontaneously 
associates itself to the membrane, we assign it a new clan indicator and a randomly chosen 
location on the membrane. Therefore we can think of spontaneous association as immigration 



3 



in which the individuals bring new genetic traits into the population. When a membrane 
molecule spontaneously dissociates from the membrane and goes into the cytosol, it loses 
both its attributes. So we can think of spontaneous dissociation as death. Note that a 
molecule that dies can get reincarnated. 

At any time, the membrane molecules can be classified into clans based on their ancestry. 
Since the molecules in a clan have a common ancestor, if the diffusion constant D is small, 
we can expect them to be clustered on the membrane. However the molecular diffusion may 
cause a clan to spread apart with time. Surprisingly this does not happen in our model. 
We mentioned before that in the infinite population limit, the cell dynamics is ergodic and 
reaches a stationary state at which the spatial spread of the clans does not change with time. 
This is due to the extremely fast nature of the birth and death mechanisms in our model 
which causes most of the molecules in a clan to be newly born. Hence they have been unable 
to move away from their common ancestor by too much. We will show that in the limit 
there are infinitely many clans present in the population at stationarity, but there are only 
a few large clans. These two results together imply that spatial asymmetry is present and 
persistent. We will then argue that there will be times when most of the population will be 
part of one large clan and also appear to concentrate around a single point. Consequently 
the cell is polarized at these times. This shows that unlike the original model, cell polarity 
is present in our rescaled model as the population size goes to infinity. For a detailed study 
of the model considered here, we refer the readers to [21]. 

This paper is organized as follows. In Section 2 we give the main results of our paper. In 
Section 3 we interpret these results in the context of biology and compare our results with 
the results provided in [1] for the original model. We also state some interesting research 
questions that we were unable to answer in this paper. Finally in Section 4 we provide the 
proofs of our results. 

Notation 

We now introduce some notation that will be used throughout the paper. Let (S, d) be a 
compact metric space. Then by B(S) (C(S)) we refer to the set of all bounded (continuous) 
real-valued Borel measurable functions. Since (S, d) is compact, C(S) C B(S). Both B(S) 
and C(S) are Banach spaces under the sup norm ||/|| = sup xg5 For any differentiable 

manifold M and k > 1, let C k (M) be the space of functions which are /c-times continuously 
differentiable. Let B(S) be the Borel sigma field on S. The space of all positive Borel 
measures with total measure bounded above by 1 is denoted by Aii(S) and V(S) is the 
space of all Borel probability measures. Since (S, d) is compact, Prohorov's Theorem implies 
that both V(S) and Aii(S) are compact under the topology of weak convergence. For any 



Js 

If fi G .Mi(S'), then for any positive integer m, fi m G M.i(S m ) refers to the m-fold product 
of /i. If jj, is an atomic measure of the form a n X^=i ^ f° r some a n > then is the 



fj.eMi(S) and/: S R let 
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symmetric m-fold product of \i defined by 



(m) _ 



1 



(1.1) 



n{n — 1) • • • (n — m + 1) 



l<i 1 =£i 2 ,---^i m <n 



where the sum is over all distinct m-tuples of {1, 2, ■ ■ • , n}. If m > n then the sum above is 
empty and ^ is taken to be 0. Observe that // m ) does not depend on a n and for n > m 
it is a probability measure over S m . Also note that if /x is a probability measure (that is, 
a n = 1/n) then for large n, is approximately equal to /i m . 

The space of cadlag functions (that is, right continuous functions with left limits) from 
[0, oo) to S is called Ds[0, oo) and it is endowed with the Skorohod topology (for details see 
Chapter 3, Ethier and Kurtz [11]). The space of continuous functions from [0, oo) to S is 
called Cs[0,oo) and it is endowed with the topology of uniform convergence over compact 
sets. 

For any operator A C B(S) x B(S), let T>(A) and TZ(A) designate the domain and range 
of A. The notion of the martingale problem associated to an operator A is introduced and 
developed in Chapter 4, Ethier and Kurtz [11]. In this paper, by a solution of the martingale 
problem for A we mean a measurable stochastic process X with paths in Ds[0, oo) such that 



is a martingale with respect to the filtration generated by X. For a given initial distribution 
7r G V(S), a solution X of the martingale problem for A is a solution of the martingale 
problem for (A, it) if n = PX(O)" 1 . If such a solution exists uniquely for all ir G V(S), then 
we say that the martingale problem for A is well-posed. 



Our first task in this section is to represent the dynamics of cell molecules as a measure- 
valued Markov process. Suppose there are N molecules in the cell (cytosol and membrane). 
The cell membrane will be denoted by E and it is a sphere of radius R in IR 3 . As we 
mentioned before, each membrane molecule has two attributes : location and clan indicator. 
The locations are elements in E while the clan indicators will be chosen as elements in the 
unit interval [0, 1]. Hence E x [0, 1] is the type space for the molecules. A molecule of type 
x = (y, z) G E x [0, 1] is located at y on the membrane and has z as its clan indicator. Note 
that a membrane molecule will change its type only due to Brownian motion. Therefore 
during its stay on the membrane only its location (first coordinate) changes while its clan 
indicator (second coordinate) remains fixed. 

If there are N molecules in the cell, then we assign mass 1/N to each molecule. The 
membrane population at time t can be represented by an atomic measure as follows: 



for any f eV(A), 




2 The Main Results 




(2.2) 



i=l 
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where n N (£) = N ( 1, fi N (t)\ is the number of molecules on the membrane at time t and 
Xi(t), ■ ■ ■ ,x n Nu\ are their types. Viewed as a process, fi is Markov and its state space is 
given by 

M^{E x [0,1]) = \^J2S Xi : < n <N andxi,-- - ,x n G E x [0,1] 



i=l 



For any ft G M^(E x [0,1]), the total mass (l,/i) < 1 and hence M%{E x [0,1]) C 
Aii (E x [0, 1]). If we endow J\A^ (E x [0, 1]) with the topology of weak convergence, then 
it is a compact space. 

The generator of any Markov process is an operator which captures the rate of change 
of the distribution of the process. For a detailed discussion on generators see Chapter 4 in 
Ethier and Kurtz [11]. For a speed D Brownian motion on the membrane E, the generator 
is given by where A is the Laplace-Beltrami operator on E. Note that C 2 (E) C T>(A), 
where C 2 (E) is the space of twice continuously differentiable functions on E. Next we define 
the classes of functions that we will use in this paper. 

Definition 2.1 

C = {/ G C ((E x [0, l]) m ) such that /(•, z) € C 2 (E m ) for all z G [0, l] m , 
and Vf(x, ■), Af(x, ■) G C ([0, l] m ) for all x G E m and m > 1} 

Definition 2.2 

C = {F(/i) = </> (m) ) such that / G C and m > 1} . 

We now specify the generator A^ for the process \i , which captures the dynamics of the 
cell population. For any / G C let Aif denote the action of the Laplace-Beltrami operator 
on / by considering it as a function of its i-th coordinate. Let the domain of the operator 
A^ be V(A N ) = C and for F G C of the form (/, define 

A^(/x) = f £<W m) ) (2.3) 

i=l 

+ A; off iV 2 jf ^ (V ^ - 1^ - F(//)J 
+ ^iV 2 (l - fc) jf ^ (f + 1^ - 



(i(dx), 



where /i = (1, fi) and $(•) is the surface area measure on the sphere E normalized to have total 
1. Terms in the operator above correspond to the surface diffusion of the membrane 
molecules, spontaneous association, spontaneous dissociation and membrane recruitment, in 
that order. The martingale problem for A N is well-posed and this can be seen by viewing 
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the operator A N as a bounded perturbation of the diffusion operator (given by the first 
term of A*). It is easy to argue that the martingale problem for the diffusion operator is 
well-posed and the solution for any initial distribution is just the empirical measure process 
of a system of particles doing independent speed D Brownian motion on E. Proposition 
10.2 and Theorem 10.3 in Chapter 4, Ethier and Kurtz [11] imply the well-posedness of the 
martingale problem for A N . 

It will soon become evident that the initial distribution of cell molecules is not important 
for the discussion in this paper. For definiteness we will assume that the membrane is 
initially empty. Let W G V (E x [0, 1])) be the distribution that puts all the mass at 
the measure. From now on fi N will be the unique Markovian solution to the martingale 
problem corresponding to (A^, 7f ) . 

We define another process h by 

h»{t) = (l,»»{t)) = ^,t>0. (2.4) 

At any time t, h N (t) is the fraction (or the total mass) of cell molecules that are on the 
membrane. We will refer to h N as the fraction process. Observe that h N (0) = 0. 

We are interested in showing the convergence of the sequence of processes {fJ> N } as iV — » 
oo. Note that the last two terms in A^ (see (2.3)) do not appear to converge independently. 
This is because terms like 



'Ex[0,l 

will typically be of order 1/N and we are multiplying them by N 2 outside. However conver- 
gence does happen because these two terms combine to give a second order term. Instead of 
directly dealing with the sequence of generators {A^} we will prove the convergence result 
in Section 4 by using the particle construction introduced by Donnelly and Kurtz in [8]. This 
construction provides a more probabilistic way of passing to the limit. 

Define an operator A as follows. For any F G C (M.\ (E x [0, 1])) of the form F(fi) = 
YYiLi (/ij /•*)> where fi G C fl C (E x [0, 1]) for alii = 1, 2, • • • , m and m > 1, define 

AFOi) = ^f)<A/ i) Ai>n</i.A*> (2-5) 

i=i j^i 



+ k 



on 



^1 ~ h eq ) 



y2 (fi(y,z)#(dy)dz- fi(x)n(dx))T\(fj,fj.} 

i=1 JEJ[0,l] 



The operator A is the generator of a Fleming- Viot process. The martingale problem corre- 
sponding to it is well-posed and each solution has paths in Cp(Ex[o,i])[0) 00 ) by Theorem 3.2, 
Ethier and Kurtz [12]. We are now ready to state the convergence result. Throughout this 
paper =>- will denote convergence in distribution. 
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Theorem 2.3 There exists a stopping time ( with respect to filtration generated by fi ) 
satisfying tn — > a.s. as N — > oo, such that if we define processes h N and ju as 

h N {t) = h N (t + r N ),t>0 (2.6) 

and 

$ N (t) = fi N (t + r N ),t>0 (2.7) 

then the following is true. 

(A) For any T > 0, 

where h ea = 1 — ^ 

(B) Suppose that the sequence of random variables {/^(O)} converges in distribution to 
n(0) and let ttq G V (V(E x [0, 1])) be the distribution of fi(0)/h eq . Then =^> fi in 
-Da1i(£x[o,i])[0, oo) as N — > oo ; where /i = h eq v and v is the Fleming-Viot process with 
type space Ex [0, 1], generator A and initial distribution ttq- 

Remark 2.4 Note that the state space of the processes jl N is Aii(E x [0, 1]), which is com- 
pact and so V {Aii(E x [0, 1])) is also compact by Prohorov's Theorem. Hence the distribu- 
tions of£i N (0) will certainly converge along a subsequence and the assertion of the theorem 
above will hold for this subsequence. In fact, the distributions ofj2 N (0) converge along the 
entire sequence (see Remark 4-4)- 

A heuristic explanation for the above result is as follows. As N gets larger, the extremely 
fast nature of the birth and death mechanisms forces the fraction process to immediately 
settle to an equilibrium value h cq (given by part (A) of the above theorem). Note that 
&fb^eq(l — ^cq) = Ksheq and so at this equilibrium value, the net influx of population onto the 
membrane due to birth matches the net efflux of population from the membrane due to death. 
Since the total mass on the membrane is not allowed to deviate from this equilibrium, any 
addition of new mass due to immigration must be concurrently offset by an equal reduction 
in existing mass due to death. Hence in the limit, the net demographic effect of immigration 
is the same as that of mutation and therefore we see a mutation-like term in the limiting 
generator A (see the second term in (2.5)). Similarly the addition of new mass on the 
membrane due to birth must be accompanied by the reduction of equal mass due to death. 
This gives rise to the second order sampling term in A (see the third term in (2.5)). These 
ideas are made rigorous in the proof of Theorem 2.3 given in Section 4. 

From now on v will denote the Fleming-Viot process given in the statement of Theorem 
2.3. We next claim that v has a unique stationary distribution and it is also strongly ergodic in 
the sense that its transition function converges asymptotically to the stationary distribution. 
In fact this convergence is exponentially fast. Let S be any metric space and let B(S) be 
the Borel sigma field on S. Define the total variation metric over the space of probability 
measures V(S) by 

lh-^ll var = sup \\vi(r) -« 2 (r)|| . 

reB(S) 



sup 

0<t<T 



h N (t)-h 



I'll 



as N — > oo, 
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Proposition 2.5 (A) The process v is strongly ergodic and it has a unique stationary 
distribution II G V(V(E x [0, 1])). 

(B) The transition function of v converges to the stationary distribution exponentially fast. 
There exists a constant C > such that 

\\PW) e •) - nCOlU < ^ex P (- ( koni l K heq) ) t 

Proof. Both parts follow from Theorem 5.1 and Corollary 8.4 in Ethier and Kurtz [12]. □ 

Define a V([0, l])-valued process v c by 

u c (t, A) = v(t, E x A), A G B([0, 1]) and t > 0. (2.8) 

We will refer to v c as the clan process as it will help us in the determination of clan sizes. We 
shall discuss this further in Section 3. As a consequence of Theorem 2.3 we get the following 
corollary. 

Corollary 2.6 The process v c is the Fleming- Viot process with type space [0, 1] and generator 
A c given by 



k c F^) = k on il , heq) jr [ (/iW^-/i(x)/x(dx))TT</i^) 

+ M1 " heq) E (m,ri-(fi,ri(fj,ri)U(f*>ri 



6 ^ l<*7^i^ rn 

where = f]^Li (/i, A 4 ) /i e C([0, 1]) /or i = 1, 2, • • ■ , m. 

Proof. The proof is immediate from the definition of v c and the descriptions of the generators 
A and A c . □ 

Since the molecules are constantly diffusing on the membrane, we would expect each 
clan to spread out more and more with time. However we will show that this does not 
happen in our model. We would like to measure the average spatial spread of the molecules 
that belong to the same clan. One way to measure it would be to randomly sample two 
molecules from the membrane population at any time t and compute their expected distance 
squared given that they are in the same clan. We call this quantity S p (t). For % — 1,2 let 
Xi(t) = (Yi(t) , Ciit)) G E x [0,1] be the sampled molecules. Then given v(t), Xi(t) and 
X 2 (t) are i.i.d with common distribution v(t). Therefore 

S p (t) = E(\\Y 1 (t)-Y 2 (t)\\ 2 \C 1 (t) = C 2 (t)) 

E(\\Y 1 {t)-Y 2 {t)\\ 2 l {Cl(t)= c 2m ) 
P{C 1 {t) = C 2 {t)) 

E{E{C l {t) = C 2 {t)\v{t))) 
E (Je J[o,i] llfi — 1 1 S l{ Cl =c 2 }v(t, dyi, dd)v(t, dy 2 , dc 2 ] 



E U E J [01] l {ci=C2} u(t, dy u dci)z/(t, dy 2 , dc 2 ] 
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From Proposition 2.5 we know that the process v has a unique stationary distribution IT e 
V(V(E x [0, 1])). At stationarity, S p (t) does not depend on t and can be written as 

Iv(Ex[o,i}) (/s/[o,i] l^ 1 -y2|| 2 l{ Cl = C2 }/i(^i,rfci)/i(dy2,rfc 2 )) Il(tZ^) 
S'p = -. ^ . (2.9) 

fv(Ex[o,i]) [SeI[o,i] Mc 1 =c 2 }V(dyudc 1 )fi(dy 2l dc 2 )) U(dfi) 

The theorem below gives a precise formula for S p . It will be proved in Section 4. 

Theorem 2.7 Let a = = t^—. Then 

eq fcfb ft off 

s 2D 

In the next section we connect all the results mentioned in this section and present the 
complete picture in our biological setting. 



3 The Biological Interpretation 

In this paper our main objective is to show that if we take the model for cell polarity given 
by Altschuler, Angenent, Wang and Wu [1] (see Description 1.1) and scale the parameters 
fcfl, and fc fj by the population size N, then unlike the original model we get cell polarity in 
the infinite population limit. In this section we describe how the results mentioned in the 
last section help us in making this conclusion. These results will also give us an insight into 
the influence of various biological parameters on polarity. 

The main convergence result, Theorem 2.3, shows that as N — > 00 the fraction of the 
molecules on the membrane at any time is equal to h eq and the dynamics of cell molecules is 
given by a measure-valued process /1 where fi = h eq u with v being a Fleming- Viot process. 
The process v has a unique stationary distribution and its transition function converges 
exponentially to this stationary distribution (see Proposition 2.5). 

At any time, the molecules on the membrane can be divided into clans based on their 
ancestral relationships. We now determine the distribution of the clan sizes. Let v c be the 
process given by (2.8). From Corollary 2.6 we know that it is a Fleming- Viot process with 
type space [0, 1] and generator A c . Such a Fleming- Viot process arises as a reformulation 
of the infinitely- many- neutral-alleles model due to Kimura and Crow [27] (see [11] for more 
details). By Theorem 7.2 in Ethier and Kurtz [12], at any time t, the random probability 
measure u c {t) is purely atomic. This means that u c (t) is of the form Y^oPi^xi, where pi is the 
size of the atom corresponding to the point mass at x\. At any t > and any clan indicator 
z e [0, 1], the size of the clan at time t corresponding to z is just fi(t, E x {z}). The sum of 
all the clan sizes is quite clearly h eq . If we normalize each clan size by dividing it by h eq , then 
the normalized size of the clan at time t corresponding to z is just u(t, E x {z}) = v c (t, {z}). 
In other words, the normalized clan sizes at time t are nothing but the sizes of the atoms in 
^c(t). From now on by clan size we always mean the normalized clan size. 

The assertions of Proposition 2.5 are true for v c as well. Let Aoo be the infinite simplex 
given by 

Aoo = {(^1,^2, • • • ) : S ^ l x i = 1 and < Xi < 1, % = 1, 2, • • • }. 

i=i 
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GEM(8) distribution is a distribution over the infinite simplex Aoo that depends on a pa- 
rameter 9 G [0, oo). This distribution is named after three population geneticists McCloskey, 
Engen and Griffiths (see Johnson, Kotz and Balakrishnan [24] and Pitman and Yor [33]). It 
is defined as below. 

Definition 3.1 GEM (9) Distribution: 

Let {W n : n = 1, 2, • • • } be a sequence of i.i.d Beta(l,9) random variables (that is, each Wi 
has density 9(1 - x) 9 ~ l for0<x <1). Define P 1 = W x and P n = (1 - W x ){\ -W 2 ) ■■■{!- 
W n -i)W n for n > 1. Then the sequence {P n : n — 1, 2, • ■ ■ } is said to have the GEM{9) 
distribution. 

If we define 

e = ^, (3.i) 
fcfb 

then at stationarity the sizes of the atoms in v c (t) are distributed according to the GEM(9) 
distribution. This is a direct consequence of Theorem 4.6 in Chapter 10, Ethier and Kurtz 
[11]. This result shows that at stationarity there are infinitely many clans on the membrane 
and their sizes follow the GEM(9) distribution. If we arrange these sizes in descending order 
then the resulting random infinite vector has the Poisson-Dirichlet distribution with the same 
parameter 9 (see Chapter 2 in [16]). The Poisson-Dirichlet distribution was introduced by 
Kingman [28] in 1975 and many of its properties are well-known. This characterization of 
clan sizes at stationarity makes it possible to compute the distribution and moments of the 
largest clan size, second largest clan size, third largest clan size and so on (see Griffiths [20]). 
The joint distribution of the first few largest clans can also be obtained (see Watterson [42]). 
If we sample n molecules from the membrane at stationarity from v(t), then the distributional 
properties of the clans represented by this sample can be studied via the Ewen's Sampling 
Formula (see [14]). All these results indicate that the clan sizes at stationarity are far from 
uniform and there are a few large clans and many small clans. Most of the molecules are 
contained in these few large clans. In fact if we sample n membrane molecules at stationarity 
then they would belong to roughly #logn distinct clans asymptotically (see Theorem 2.11 
in [16]). 

The quantity S p (given by 2.9) measures the average spatial spread of the clans and its 
value at stationarity is given by Theorem 2.7. At stationarity there are only a few large 
clans and if S p is small relative to the cell size then the spatial spread of these large clans is 
also small. Therefore the distribution of molecules at stationarity is highly asymmetrical at 
all times. 

We now discuss the emergence of cell polarity. First we need to define it mathematically. 



Definition 3.2 (e-polarity) For any < e C 1, we say that the cell is e-polarized if at 
least (1 — e) fraction of the membrane population belongs to a single clan and also resides in 
a single hemisphere on the membrane. 

The above definition is motivated by the biological literature (see [1, 5, 1]). Note that the 
molecules in a clan will generally appear to cluster around the location of their most recent 



11 



common ancestor (see [7]). Therefore as in [1], if diffusion is small, having one predominant 
clan on the membrane is a good indication that a single site of polarity has formed. 

At stationarity, the probability that the cell is e-polarized at any time t can be expressed 

as 

p € =11 ({/3 G V(E x [0, 1]) : There exists a z G [0, 1] and a hemisphere (3.2) 
H C E such that P(H x {z}) > 1 - e}) , 

where II G V(V(E x [0, 1])) is the stationary distribution of the process v. We mentioned 
before that at stationarity, the vector of clan sizes in descending order follows the Poisson 
Dirichlet distribution with parameter 6. Let V\ be the size of the largest clan. For any e > 0, 
Theorem 2.5 in [16] implies that 

q t := P (Vi > VT^l) > 0. (3.3) 

Suppose that we are at stationarity. Let r e denote the probability that (-\/l — effraction of 
the molecules in the largest clan are situated in a single hemisphere on the membrane given 
that the size of the largest clan is at least \/l — e. Observe that S p is like a weighted average 
of the spatial spreads of the clans, where the weight of each clan is proportional to its size. 
Therefore if almost all the molecules are in the largest clan then S p is approximately the 
spatial spread of the largest clan. Hence if S p is small in comparison to the cell size, we can 
reasonably expect r t to be positive. Observe that p e > q t r e and so p e is also positive for a 
small positive e. Since the process v is ergodic, Birkhoff's ergodic theorem (see Theorem 
10.6 and Corollary 10.9 in [25]) implies that the cell will definitely reach the e-polarized state 
and in fact spend p e proportion of its time there. Thus the cell gets e-polarized infinitely 
often and we have recurring cell polarity. 

Before we proceed we need to define some new quantities. Let 

S„ = (3.4) 



X=% (3.5) 



and 

V ^cq / \kfb-k oS J 

We can interpret S p as the average spatial spread of the clans relative to the cell size while x 
can be seen as the speed of diffusion relative to the cell size. Note that the ratio (1 — h eq ) /h eq 
is the molecular mass available in the cytosol for recruitment per membrane molecule. The 
parameter 7 is just the feedback rate tempered by this availability ratio. It can be interpreted 
as the effective feedback strength. We can recast the result of Theorem 2.7 as 

Sp = ((i + lh + xY (3,7) 
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Recall that the biological parameters in our model are D, R, k on , kfb and k Q s- We now 
examine their impact on cell polarity. Instead of working with the original parameters we 
will work with 9, \ an d 7- From the above discussion it is clear that the formation of polarity 
will be facilitated if the probability q e (given by (3.3)) is high while the quantity S p is low. 
As noted earlier, the parameter 9 controls the distribution of molecules into the infinitely 
many clans present at stationarity. From the properties of Poisson Dirichlet distributions we 
know that the probability q e decreases as 9 increases and vice-versa (see [6] and Chapter 2 
in [16]). In fact it can be shown that this probability is nearly 1 if 9 ~ (see [15]). Hence 
polarity will establish more easily if 9 is small. Recall that the process v c (given by (2.8)) 
is the Fleming- Viot process corresponding to the infinitely-many-neutral-alleles model. The 
sample paths of this process take values over the space of atomic measures over [0, 1]. Using 
Dirichlet forms Schmuland [36] has shown that with probability 1 there will exist times at 
which this process will hit the state of having a single atom if and only if 9 < 1. Therefore 
9 < 1 assures that there will exist times when there is only one clan present. At these times 
the chances of observing polarity will be nearly 1 if S p is sufficiently small. The formula 
(3.7) makes it clear that the quantity S p gets smaller as the relative diffusion speed (%) goes 
down or the effective feedback strength (7) goes up. 

Recall that the likelihood of finding a cell in the e-polarized state at any time at sta- 
tionarity is given by p e (given by (3.2)). The observations made in the preceding paragraph 
show that p e increases with 7 but decreases with 9 and %■ Unfortunately we do not have a 
precise formula for p e at the moment. Such a formula would be really useful in determining 
the chances of observing polarity in a cell with any given set of parameters. It will also give 
us a clear idea of the time spent by the cell in the polarized state. 

We would now like to compare our results to the results presented in [l] for the original 
model. To avoid confusion we will denote the association, dissociation and recruitment rates 
in the original model as k° n , k° s and k^ respectively. Note that under our scaling k° n = k on 
while fc° ff = Nk f[ and k^ = Nkn,, where N is the total population size. The analysis in 
[1] assumes that > k° s and k° n is much smaller in comparison to k° s or k^. Observe 
that spontaneous membrane association tends to spatially homogenize the molecules on the 
membrane and so if k° n is not small in comparison, we cannot hope to see cell polarity. Under 
the above assumptions it is shown in [1] that the fraction of molecules on the membrane 
approaches the equilibrium value 



at an exponential rate with half-time of (heqkfy) . In our scaling, k° n = 0(1) while and 
k° s are O(N). Therefore it is not surprising that as N — > 00 the fraction of molecules on the 
membrane reaches the equilibrium value 



almost instantly (see part (A) of Theorem 2.3). Since k on is small, the bulk of the population 
at equilibrium must come through membrane recruitment. It is mentioned in [1] that if 
fcft, < k° s , the membrane will be nearly empty at equilibrium and so clusters cannot form. 
For the same reason Assumption 1.2 is required in this paper. 
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As we have discussed above, the emergence of cell polarity crucially depends on the 
likelihood of having just one large clan on the membrane. It is shown in [1] that for a finite 
population size N, the number of clans on the membrane will reduce to just 1 at certain 
times giving rise to polarity (if D is small), if the spontaneous association events are rare 
(k° n is small). However the frequency at which these times arrive is proportional tol/N and 
hence there is no recurring polarity in the infinite population limit unlike the rescaled model 
that we consider. 

It is observed in [1] that the clustering behavior for the original model is entirely deter- 
mined by a simple relationship between the ratio -fir and the population size N. Their analy- 

sis shows that if -fir <C A^ -2 then certainly one cluster will form and if -fir ^> (A^ -1 log AH 1 / 2 
then no clusters will form. Using numerical simulations they observe that the transition 
occurs when jr « AT -1 . This motivated us to scale k^ by a factor of A" and analyze the 

fb 

resulting model. We also had to scale k° s by A" because otherwise the entire population 
will soon be on the membrane (as h eq will then be 1), depleting the cytosol and preventing 
further membrane recruitment. In such a scenario, the feedback mechanism will be unable 
to counter the surface diffusion and hence there will not be any lasting cell polarity. Note 
that having = Nk^, is the same as changing the feedback rate in Description 1.1 to 
fcfb x (number of molecules in the cytosol) . This is the same as saying that each membrane 
molecule recruits each cytosol molecule at rate fcft,. Such a definition may be more natural 
for the feedback circuits found in certain cells. Our results provide an explanation for the 
existence of cell polarity in such cells if the population size is large. 

There are many biologically appealing questions about the model that we have been 
unable to answer in this paper. As we mentioned above, it would be useful be have precise 
estimates for p t . It would also be interesting to compute the time it takes to hit the e- 
polarized state and the time the cell stays polarized after that. These results would give us 
a better idea about the the onset and maintenance of polarity. The role of various model 
parameters will emerge clearly as well. 

The model we study does have the drawback of being simplistic as all the molecules 
in the cell are identical. Most cells that exhibit polarity have molecules of many different 
types recruiting each other at various type-dependent rates (see [9, 3, 40]). We would like 
to know if a multi-type generalization of our polarity model would also lead to tractable 
measure- valued dynamics in the infinite population limit. We hope to answer this question 
elsewhere in the very near future. 

In this paper we have only looked at single-site polarity. Many cells exhibit anterior- 
posterior polarity (see [37, 32, 13]). In such cells there are usually two types of molecules 
and they segregate themselves in such a way that one type of molecules form the front and 
the other type of molecules form the rear. Such an arrangement is vital for cell division 
and locomotion. It has been suggested that this phenomenon is caused when molecules 
not only recruit the molecules of their own type but also locally inhibit the recruitment 
of the other type. It may be possible to extend the model considered here to account for 
anterior-posterior polarity as well. 

The story of cell polarity is far from over and we hope that more work will be done to 
mathematically understand this biologically vital phenomenon and answer the challenging 
questions it poses. 
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4 Proofs 

In this section we prove the main results of our paper: Theorem 2.3 and Theorem 2.7. 



4.1 Proof of part (A) of Theorem 2.3 

Recall that n N (t) = N {1, p N (t)} denotes the number of molecules on the membrane at time 
t. Since p N has generator A N , we can write the generator K N for the Mo- valued process n N 
as the following. For / G C (R) let 

K N f(n) = k on (N - n) (/ (n + 1) - /(n)) + iVfc off n (/ (n - 1) - /(n)) 
+ feftnCTV - n) (/ (n+l)-/(n)). 

We start with nothing on the membrane and hence n (0) = 0. The form of the generator 
if-^ allows us to write the equation for n N as 

n N (t) =Y 1 (k m J* {N - n N (s)) ds^j - Y 2 (V ofr J\ N (s)ds^j (4.1) 

4 ,n,^ (, ""00 



+ F 3 ( A^fb ^ n JV (s) ^1 - j dsj . 

We would like to estimate the first time n N reaches a positive fraction of the population 
size N. Pick an e > such that k^(l — e) > fc Q g and define 

pf = inf {t > : n*(f) > Ne}. (4.2) 

Lemma 4.1 Let A = &/&(l — e) — k g. Then 

Moreover, — > a.s. as N — >■ oo. 

Proof. We first slow the time by a factor of iV. Let n N (t) = n N (t/N). Since satisfies 
equation (4.1), n N satisfies 

n N (t) =Y 1 (k on jf (l - ds^j - Y 2 (k oS jf n N (s)ds^j (4.3) 

+ Y 3 (k fb j\ N (s) (l-^) * 
Define 

pf = inf {t > : n N {t) > Ne} = Npf. (4.4) 
To prove the first claim of the lemma we only need to show that 

lim P < M = 1. (4.5) 
15 



For < t < p^ , " < e. Define another process Z by the equation 

Zit) =Y X (k on (l - e)t) - Y 2 (k oS j^ Z(s)ds^ + Y 3 (k^l - e) jf Z{s)d^j . (4.6) 

Note that Z is independent of N and e is chosen so that fcfb(l — e) > k Q. The form of 
the equation for Z shows that Z is a supercritical branching process with immigration. For 
< t < pf we clearly have Z(t) < n N (t) < eN . Define 

pf = inf {t > : Z(t) > Ne}. (4.7) 

It is easy to see that p^ < p^ . We will find a probabilistic upper bound on p^ which will 
show (4.5) and hence prove the first claim of the lemma. 

A supercritical branching process with immigration can be written as a superposition 
of independent supercritical branching processes starting with an initial population of 1 at 
various times. This fact along with Theorems 1 and 2 in Chapter 3, Section 7, Athreya and 
Ney [2] show that there exists a random variable W such that W > a.s. and 



Therefore 



lim e- M Z(t) = W a.s. 

t— >oo 



lim e" A ^Z(pf ) = W a.s. 

V— s-oo 



which implies that 

lim log ( e-fPzCp? )) = lim (-Apf + log Z(pf )) = log W a.s. 



Observe that Ne < Z(p^) < Ne + 1. From above we get 

p N 1 
lim - — — = — a.s. 

N^oo log N A 

Since Np^ = p^ < p^ a.s., the above limit implies (4.5) and also shows that p^ — > a.s. 
as N — > oo. This completes the proof of the lemma. □ 

Recall the definition of equilibrium fraction h eq from the statement of Theorem 2.3. Fix 

e to be ^ = \ ^1 — 7^ J and let p N be p^ for this particular choice of e. By Lemma 4.1 we 

obtain 



limP(p^< -^— )=1. (4.8) 

Recall that the process h is given by (2.4). Using equation (4.1) we can write an 
equation for h N as 

h N (t) ~Yt (Nk on £ (1 - h N (s)) da) - ±Y 2 (N 2 k oS J*h N (s)ds\ (4.9) 
+ ^Y 3 (l^k* J" h N (s) (1 - h N (s)) ds^j . 
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— N 

Let h be the process given by 



Note that 

For % = 1,2,3 let 
where 



h N (t) = h N (t + p N ) . (4.10) 



h N (0) = h N (p N ) = I^fl. (4.11) 



N 

Y i {t) = Y i (t + 6f r )-Y i {SD 

tff = iVA; on / (1 - h N (s)) ds 
Jo 

5? = N 2 k oS / h N (s)ds 
Jo 



and 



;N _ at2i 



N 2 k & / h N (s) (1 - h N (s)) ds. 



From the strong Markov property of the Poisson process we can conclude that Yi, Y2 and 



Y 3 are independent unit Poisson processes. We can write the equation for process h as 

h N (t) = h N (0) + h N (t + p N ) - h N (p N ) (4.12) 
= 7^(0) + iFi (konN^ (l-h N (s)) ds^j - ^F 2 (N 2 k oS J*h N (s)ds^J 

+ ^F 3 (iV 2 A; ft j\ N {s) (l - ^(5)) ds) . 

Let Y\ be the centered version of Yi (that is, Y°(u) = Yi(u) — u for it > 0). Define 

M N (t) =^Y\ (k on N J (l ds) - ^ ^N 2 k oS j\ N (s)ds^ (4.13) 

+ ^3 {N 2 k & j\ N {s) (l-h N {s)) ds^ , 



which is a martingale with quadratic variation given by 
1 



^y 3 (iv 2 ^ / As) f 1 - h N ( s ) ) d s 



— N 

Since < /i < 1 we have 



E([M N ] t ) < k on ^ + fc off t + kfbt. (4.15) 
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By centering the Poissons in equation (4.12) we can write 

J (l-h N (s)^ ds - Nk oS J^ h N (s)ds (4.16) 
+ Nk & J h N (s) (l - h N (s)J ds + M N (t). 



h N (t) =h N (0) + k D 



Let F{h) = k{t,h(l — h) — k Q sh, and 



Z N (t)= / k on (l-h (s))ds + M N (t). 
Jo 

From (4.15) and Corollary 2.3.3 in [23] we can conclude that {Zjy} is a sequence of semi- 
martingales that is relatively compact in Dk[0,oo). The jumps in Zn are of size 1/N and 
hence if Z is a limit point of this sequence then Z must be a continuous process a.s. Equation 
(4.16) can be written as 

if {t) =h N {0) + Z N (t) + N [ F(h N {s))ds. (4.17) 

Jo 

Define another process a N by 

a N (t) = h N (t)-h eq . 

Observe that 

F(h N (t)) = fcfb (a N (t) + h cq ) (1 - h cq - a N (t)) - k oS (a N (t) + h eq ) 
= a^t) (fcfb(l - h cq ) - fc off ) + fcfb/i eq (l - h eq ) 
- k oS h cq - kn,a N (t)(a N (t) + h cq ) 



-k^a N {t)h N [t) . ( Using h eq 



1 



off 



V ° 4 k 



fb 



From equation (4.17) we get 

r-t 

a N (t) = a N (0) - Nka, / h N (s)a N (s)ds + Z N (t), 

Jo 

which can be written in differential form as 

da N (t) + NkJi N {t)a N {t)dt = dZ N (t). 

Let /?Ar(t) = Nkfb Jq h N (s)ds. Then 

d(a N (t)e Mt) ) =e Mt) dZ N (t). 

Integrating from to t we get 

a N (t)e Mt) - a N (0) = f e Ms) dZ N (s). 
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Therefore 



a N (t) - a JV (0)e-^W = e"^ / e^dZ N {s) 



Let 



For < t < a N 



mi{t>0:h N (t)<^}. 



h N (t) > _ 1 
w - 4 N 



and hence for any small positive number e 



inf \p N (t + e) -Mt)\ =>- oo 



as TV — 7- oo. Fix any T > 0. By Lemma 5.2 in [26 

-i 



sup e 

0<t<TAa N 



z Ms) dZ N {s) 



Hence equation (4.18) gives 



sup 

0<t<TAa N 



0. 



\a N (t)-a N (0)e- Mt) \ =4-0. 



(4.18) 



(4.19) 



If <7 < T then from the definitions of a , a and (4.11) we get 



sup \a N (t)-a N (0)e-^\ 

0<t<TAa N 



> 



> 



a N (a N ) - a N (0)e 



3h ea h c 



> 



h 



*eq "-eq -0 N (y N ) 

4 + 2 
2 



2 

iV 



cq 

4~ ~~ N 



This calculation shows that 



P(a N <T)<P\ sup \a N (t) - a N (0)e~ Mt) \ 

, 0<t<TA? iv 



> 



'cq 



N 



Therefore from (4.19), P (a N < T) — > for any T > and hence cr w — >■ oo in probability. 
Let the stopping time tn be given by 



T N = p 



N 



hgN 
N 



(4.20) 



From Lemma 4.1 p — > a.s. and hence — > a.s. Let the process h be defined by 



h N (t) = h N (t + T N ) = h" [t + 



■N 



logiV 
N 



,t > 0. 



(4.21) 
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Note that 



sup 

0<t<((TAa N )-^-) 



h N (t) - h 



cq 



= sup 

0<t<((TAa N )- 1 ^-) 

< sup 

0<t<((TAW N )- 1 -^-) 

+ sup 

0<t<({TAa N )- 1 -^) 



a N \t 



a N \t + 



logiV 
N 

logiV 
N 



The first term converges to in probability due to (4.19). Observe that for < t < 
((TA^)-M) 



PN\t + 



logiV 
N 



t+ 



log JV 

h (s)ds = fcfb 



^eq 1 

4 N 



log N. 



Thus the second term above converges to as N — > oo a.s. Since log N/N — > and cr^ — > oo 
we get 



sup 

0<t<T 



h N (t)-h 



0. 



This proves part (A) of Theorem 2.3. 



4.2 Proof of part (B) of Theorem 2.3 

To prove part (B) of the theorem, we will use the particle construction introduced by Donnelly 
and Kurtz in [8]. In this construction the molecules are arranged in levels which are indexed 
by positive integers. The arrangement is such that for any positive integer n, the process 
determined by the first n levels is embedded in the process determined by the first (n + 1) 
levels. This allows us to pass to the projective limit. Another advantage of this construction is 
that it makes the ancestral relationships between molecules explicit. For any set of molecules 
we can trace back their genealogical tree to obtain results about the measure- valued process. 

We first motivate the particle construction. Suppose the total population size is N and at 
any time t there are n N (t) molecules on the membrane. The process n N follows equation (4.1) 
and suppose its evolution is known. Each molecule has a type in E x [0, 1] as before. We can 

represent the population on the membrane at time t by a vector \ Yf*(f), Y^(t), ■ • • , . 
Since the labeling of the molecules is arbitrary it contains exactly the same information as 
the measure Z(t) = ^Li^y^t)- We can choose any labeling we find convenient. So we 
look into the future and order the individuals according to the time of survival of their 
lines of descent. In this new ordering we arrange the molecules into levels, which are taken 
to be positive integers. At any time t, if there are n N (t) molecules, we will represent the 

population as the vector (x^(t), X^(t), ■ ■ • , X^ N ^\. We will refer to Xf as the z-th level 
process where Xf{t) G E X [0, 1] is the molecule type at level i at time t. Molecules are 
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allowed to change levels with time. If a death happens at time t then n N (t) = n N (t—) — 1 
and we just remove the molecule at the highest index n N (t). If an immigration happens at 
time t then n N (t) = n N (t—) + 1 and we uniformly select a level from the first n N (t—) + 1 
levels and insert the immigrant molecule there. If a birth event happens at time t then 
n N (t) = n N (t—) + 1 and we do the following. We first uniformly select two levels i and j 
from the first n N (t—) + 1 levels. Suppose i is the smaller of the two levels. Then we shall 
refer to the molecule X^{t—) as the parent and insert a copy of it at level j. The molecules 
{Xj?(t— ) : k — j, j + 1, • ■ ■ } are shifted up by one level. So at time t, the offspring molecule 
Xf (t) is a copy of Xf (i-) while Xj?(t) = Xj?(t-) for k < j and Xg(t) = X^t-) for 
k > j. In between all these events molecules are doing speed D Brownian motion on E and 
changing their location. 

What we have described above is a Markov process X N with state space 

N 



S N = (J (E x [0, 1])" . 



n=0 



We adopt the convention that (E x [0, 1])° = {A}. For x G S N , if x G (E x [0, 1])" then let 
\x\ = n for any n — 0, 1, • • • , N. If at time t, X (t) = x G S and |x| = n, then it means 
that there are n molecules on the membrane with the type vector x = (x\,X2, ■ ■ ■ ,x n ) G 
(Ex[0,l]) n . 

If \x\ = n > m and x = (x±,X2,-- - ,x n ) then let x' m = (xi,X2,-" ,%m)- Any / G 
B ((E x [0, l]) m ) can be regarded as a function over S N by defining it as f(x) = if \x\ < m 
and f(x) = f(x^ m ) if |x| > m. We now specify the generator A N of the Markov process X N 
by its action on functions in its domain T>(A N ) = C (see Definition 2.1) as 



D 



AN f( x ) =J E A ^( x ) + nNk oS (f(d n (x)) - f(x)) (4.22) 
i=i 

(at- \ n+X „ „i 

-^T)IlJJ Wi(x,(y,r)))-f(x))#(dy)dr 

+2k^(^f\ Wa{x))-f(x)) 

l<i<j<(n+l) 

where n = |x| and if x = (xi, x 2 , ■ • ■ , x n ) then di(x) = (xi, x 2 , ■ ■ ■ , • • • , x n ) (remove 

the i-th coordinate), 9ij{x) = (xi, ■ ■ ■ ,Xj_i,Xi,Xj, ■ ■ ■ ,x n ) (insert a copy of x^ at the j-th 
place) and 9i(x, (y,r)) = (xi, ■ ■ • , Xj-i, (y, r),Xi,--- , x n ) (insert (y, r) at the z-th place). 

Viewing the operator A N as a bounded perturbation of the diffusion operator (given by 
the first term on the right of (4.22)), we can argue that the martingale problem for A N is 
well-posed in the same way we argued for in Section 2 (for details see Theorem 3.17 in 
[21]). We now relate any solution of the martingale problem for A N to a solution of the 
martingale problem for A N (see (2.3)) by using the Markov Mapping Theorem (see Theorem 
2.7 in Kurtz [29]). Let 



qN 
D 



M^(Ex [0,1]) = j^XX :0<n<iVand x u ■ ■ ■ , x n G E x [0,1] 
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and 

N 

S N =\J(Ex[0,l]T 

n=0 

as before. Define 7 : S N — > Sq by 

1 - 

j{x) = — E S *i if x = ( x u x 2, • • • , x n ). 

i=l 

Define the transition function a : Sq — > V (S N ) by 

\ i=l / <r£S„ 

where S n is the set of all permutations on {1, 2, • • • ,n}. 

Lemma 4.2 Lei 7Tg £ V(Sq) and define ir N = J sN a(y , -)~Kq (dy) . If v N is the solution of 
the martingale problem for (A n ,7Tq) and X N is the solution of the martingale problem for 
(A N ,7r N ), then 7(X Ar ) and v N have the same distribution in D m n(ex[o,i])[^i °°)- Further- 
more, for any t > the distribution of X N [t] = (X^(t), X^(t), • • •) is exchangeable. 

Remark 4.3 The length of the vector X N (t) is n N (t), which is a random variable. When 
we say that the distribution of X N (t) = (X^(t), X^it), ■ ■ ■ ) is exchangeable we mean that 
given n N (t) = n, the distribution of 
(X?(t),X?(t),--- ,X*(t)) is exchangeable. 

Proof. The definition of a ensures that 7 _1 (/i)) = 1 for all /x £ S$ . If / £ C D 
C {{E x [0, l]) m ) and fi = ± £™ =1 then let 

= / f(z)a(fi,dz) = - V f(x a{1) ,--- ,x a{n) ) = (f,^ m) ). (4.23) 
JsN nl £Z 

Hence F £ C = Z~>(A). Now we show that for such a function F 

A N F(-)= [ A N f(z)a(-,dz). (4.24) 
Js N 

On writing down the expressions for and A N using (2.3) and (4.22), we observe that there 
are four terms on each side of (4.24). We will show that the equality holds term by term. 
It is easy to see that the first term corresponding to the Brownian diffusion of membrane 
molecules is equal on both sides. We check the equality for the next three terms below. 
For x = (xi, ■•• ,x n ) let a(x) = (x a rt) , x a{ 2) , • ■ • ,x a(jl) ) for any a £ S n . Let /1 = 

iET=i<W Then 

F (v+^(y,r) \ = ( 7l ^ 1 )| E f(<7(9n+l(x,(y,r)))) 

n+1 , 

=— r E - E / ( w*). fo. r ))) • ( 4 - 25 ) 

n + 1 ^— ' n! ^— ' 

i=l crgS n 
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Similarly 



I ' 1 ] V N ' i=l (n + 1)! . G E {rl+1) 

1 ™ 



2 



I E ^fE^(M^))). ( 4 -26) 



n , 

l<i<j<(n+l) o-eS 



where n + = in + 1) in the first equation above. Finally 

N f F L - h x ) n(dx) = E / ^ 



in 



^/H^))). (4-27) 



-n 

n. 
o-es 



Equations (4.25), (4.26) and (4.27) show that the relation (4.24) holds and so the Markov 
Mapping Theorem is applicable. Therefore we can conclude that j(X N ) and v N have the 
same distribution in i^Al w (-Ex[o,i])[0, oo). From Corollary 3.5 in Kurtz [29] we obtain that if 
n N (t) = n then 

E(f(X»{t),X»{t),---,XZ{t))\?t)= [ f(z)a( 1 (X N (t)) J dz) J 

Js N 

where {J^t} is the filtration generated by the process 7(X JV (-)). Since a is symmetric, the 
distribution of \X^(t), X^it), • • ■) is exchangeable. □ 

Recall from Section 2 that Wq G V {M.^ (Ex [0, 1])) is the distribution that puts all the 
mass at the measure and fi N (given by (2.2)) is the unique solution to the martingale 
problem corresponding to (A^, 7fo). For tn given by (4.20), define the process p, N by 

/2 N (t) =f i N (t + T N ),t > 0. (4.28) 

Also let 

n N (t) = Nh N (t) =n N (t + r N ),t> 0. (4.29) 

Let 7?^ e V(M%(E x [0, 1])) be the distribution of (jl n (t n ) = j2 N (0) and define vr^ e V(S N ) 
by tt n = j sN a(y, - )ttq (dy). Let X N be the unique solution to the martingale problem for 

[A N , tt n ). Note that X N lives in the space S N = {j" =0 (E x [0,1])" and for any t > 0, 
\X (t)\ = n N (t). The process h converges to h eq uniformly over compact time intervals 
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(from part (A) of Theorem 2.3). Hence n N converges to oo uniformly over compact time 
intervals as well. 

For part (B) of Theorem 2.3 we assume that the sequence of random variables {fi N (0)} 
converges in distribution to fi(0) as N — >■ oo. Let tt G V (J\4\(E x [0, 1])) be the distribution 
of (i(0) and n G V (V(E x [0, 1])) be the distribution of fi(0)/h eq . Our assumption implies 
that 7r^ converges weakly to 7Tq. Due to part (A) of Theorem 2.3, this is equivalent to saying 
that the distributions of fi N (r/v) j h N (r/v) converge weakly to 7r . 

Now sample a probability measure /i from ir and let (Yi, Y%, ■ ■ ■ ) be an infinite sequence 
of exchangeable random variables with de Finetti measure \i. Let n G V((E x [0, 1])°°) be 
the corresponding distribution of (Y 1; Y 2 , ■ ■ ■ ). Since 7?^ =>■ n Q we also have n N =>- ir. 

From now on consider X N as a process over (E x [0, 1])°° in which the components greater 
than iV do not vary. The space (E x [0, 1])°° is given the usual product topology. 

We can regard any function / G C fl B((E x [0, l]) m ) as a function over (E x [0, 1])°° by 
defining it as f(x) = /(x' m ) = /(xi, • • • ,x m ) for any x G (E x [0, 1])°°. By the definition of 
X N , for any / G C n B((E x [0, l]) m ) 

M2j(t) = f(X N (t)) - f A N f(X N (s))d Sl (4.30) 

Jo 

is a martingale. Define another process 

n»(t) 

Z N (t) = j(X N (t)) = - Sx»(t),t > 0. (4.31) 

i=i 

From Lemma 4.2, the process has the same distribution as the process fi, N . Hence if 
F G C is given by F(/i) = (/, then 

A/f = F(Z N (t)) - f A N F(Z N (s))d Sj (4.32) 
Jo 

is also a martingale. If 1X^(^)1 = n N (t) > m then 

n - 

A N f(X N (t)) = ^"£ A tf( x ) 
i=i 

+ fcon ^ X j\m(X N (t), (y, r))) - f(x))i>(dy)dr. 

The deai/i term drops out because only the molecule at the highest level is allowed to die and 
/ depends on only the first m levels. From above it can be easily seen that for any positive 
integer m and / G C fl -B((-E x [0, l]) m ), the supremum of the process A N f(X N '(•)) over 
compact time intervals stays bounded asiV-^ oo. Using this fact along with (4.30), (4.32) 
and (4.24) it is easy to argue that the sequence of processes {(X N , Z^)} is relatively compact 
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in D(Ex[o,i])°°xMi(Ex[o,i\)[Q] 00 ) ( see Corollary 9.3 and Theorem 9.4 in Ethier and Kurtz [11]). 
Suppose (X, Z) is any limit point and (X N , Zjf) => (X, Z) along the subsequence /cat. By the 
continuous mapping theorem and the boundedness of / £ CC\B((E x [0, l]) m ), the sequence 
of martingales {M^At)} converges along the subsequence to 

M XJ (t) = f(X(t)) - [ A m f(X(s))ds, (4.33) 

Jo 

which is a martingale with respect to the filtration generated by X. The operator A m is 
given by 

A m f(x) = -J2^f^) + ^[-^) £ */(%(•'•)) - /!•'•)) (4.34) 

i=l ^ eq / Ki< 7 -<m 



l<i<j'<m 



■ X) / / (/(^»(y.0))-/(a:)Wy)dr 

for any / £ T> (A m ) = C R B((E x [0,l]) m ). The operator A m is the generator for the 
process determined by the first m levels of the limiting process X. We can easily check 
that the martingale problem for A m is well-posed due to the same reasons that were given 
for A N . Taking V(A) = U~ =1 £> (A m ) and defining Af = A m f if / £ V(A m ), we see that 
the martingale problem for A is well-posed. The distribution of X N (0) (denoted by ir ) 
converges to n. From (4.33) we know that for any positive integer m, the process followed by 
the first m levels of X has generator A m . Hence X is the unique solution to the martingale 
problem corresponding to (A,tt). 
Let 

7tv = inf{t > : n N (t) = 0} 

and for any < t < 7/v define 

n N (t) 
v > k=i 

Observe that 

The process /i^ converges to the constant process h eq . Therefore 7^ — > 00 in probability 
and for any £ > 0, n N (t) — > 00 in probability. Define Z to be the process 

Z(t) = ^t>0. 

'•'eq 

Then Z(0) has distribution 7r and since (X N ,Z N ) (X,Z) along the subsequence /cat, we 
must have that (1^, Z^) =>■ (X, Z) along the same subsequence. Notice that for any t, 
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^N(t) Z(t) implies that Zjy (i) =>• Z m (t). From the exchangeability of X N (t) we get that 
for any / G C n B((E x [0, l]) m ) 

E {f{X» (t), ■ ■ ■ ,X%(t))) = E ((/, Zfr\t))) = E ((/, Z^{t))) 

for any < t < 7^. Passing to the limit along the subsequence k N we obtain 

E{f{X x {t),--- ,X m {t)))=E{{f,Z m {t))) 

for any t > 0. It shows that conditional on Z(t) , Xi(t), X 2 (t), ■ ■ ■ are i.i.d. random variables 
with distribution Z(t). Hence for any t > 0, is exchangeable with de Finetti measure 
Z(t). Therefore 



Z{t) = lim - J2s Xi{t) , a.s. (4.36) 

i=l 

and if {Ff} is the filtration generated by the process Z then 

E(/(X 1 (t),---,X m (t))|J-f) = (/,Z m (t)). 
Conditioning (4.33) with respect to {Tf} we obtain that 

(f,Z m (t))- f (A m f,Z m (s))ds 
Jo 

is a martingale. If we define an operator A by 

AF(/i) = (A m /,/i m ) 

for F(n) = (/, /i m ), then this definition agrees with the definition of the Fleming- Viot gen- 
erator A given in Section 2 by (2.5). The martingale problem for A is well-posed and hence 
Z is the unique solution to the martingale problem for (A, ttq). 

From the discussion above it is clear that (X N , Z^) =>■ (X, Z) where Z = h cq Z and Z is 
a Fleming- Viot process with generator A. Since the process fi N has the same distribution 
as the process Z N , we also have fi N =^ h eq Z in I?Mi(.ex[o,i])[0, 00). This proves part (B) of 
Theorem 2.3 with the process v being the same as the process Z. 

Remark 4.4 The approach of particle construction that we used to prove part (B) of The- 
orem 2.3 can also be used to show that the distributions of Ji N {0) = /i (tjv) converge 
along the entire sequence. Since — > a.s. we first do a random time change , ~f N 
(which is a bijection from [0, 00) to [0, 00) depending on the population size n N ) such that 
•y N (tn) —> p a.s. where p is a positive random variable. We then use the particle construc- 
tion X N (t) = (x^(t), X^it), • ■ ■ j similar to the one used here, except that the birth, death 

and immigration rates are altered according to the random time change. Next we show that 
the process X N converges on the random time interval [0,7^ (tjv)] as N — >■ 00. With a bit 
more work it is possible to conclude from this convergence that the distributions of p> n (tn) 
converge as well. 
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4.3 Proof of Theorem 2.7 



The membrane molecules are doing speed D Brownian motion on the sphere of radius R, 
which we call E. Suppose the sphere E is embedded in R 3 with its center at the origin. Let 
B = (Bi, B 2 , B 3 ) T be a Brownian motion on E with speed D and let W = (Wi, W 2 , W 3 ) T be 
a standard Brownian motion in M 3 . Henceforth, let (■, ■) denote the standard inner product 
in M 3 and let ||-|| denote the corresponding Euclidean norm. From Stroock [39] it follows 
that we can express B as the solution of the Ito's equation 

dB = VI)(l- dW - D^dt. (4.37) 

From above, it is immediate that for any t > 

E(Bi(t)) = Bi^e'^ for i = 1, 2, 3. (4.38) 



Lemma 4.5 Let B and B be two independent speed D Brownian motions on the sphere E. 
Then for any t > 

B (|| B(t )-5 W |r)=2^(l-W«e-»'). 

Proof. This result is a consequence of the simple calculation below. 

E (\\B(t) -B(t)\\ 2 ) 

= E ((^(t) - B^t)) 2 + (B 2 (t) - B 2 {t)f + (J3 s (t) - B 3 (t)) 2 ) 
= E (Bl{t) + B 2 2 {t) + B 3 (t) 2 + B\{t) + B 2 2 (t) + B 3 {t) 2 
-2B 1 {t)B 1 {t) - 2B 2 (t)B 2 (t) - 2B 3 (t)B 3 (t)) 

= 2R 2 - 2E(B 1 (t))E(B 1 (t)) - 2E(B 2 (t))E(B 2 (t)) - 2E(B 3 (t))E(B 3 (t)) 
= 2tf(l-<?«>e-^) (Using (4.38)). 

□ 

Recall the definition of S p given by (2.9) and the definition of the process X. We assume 
that we are at stationarity and hence we can also assume that X is defined for all t £ 
(— oo, oo). At any fixed time t the sequence X(t) = (Xi(t), X 2 (t), • • • ) is exchangeable and 
its de Finetti measure Z(t) has the same distribution as v{t). Thus the distribution of two 
molecules sampled from v(t) is the same as the distribution of the first 2 levels X\(t) and 
X 2 {t). For 2 = 1,2 let X-(t) = (Y^t) , d(t)) where Y^f) £ E and d(t) £ [0,1]. From the 
calculation in Section 2 we can write 

S p = E (||Yi(t) - y- 2 (t)|| 2 Idft) = C 2 (t)) . (4.39) 
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The process determined by the first two levels of X evolves according to the generator A 2 
given by (4.34) with m = 2. From the definition of A 2 it is clear that level 2 looks down 
to level 1 at rate 2k^a where a = (1 — h eq )/h eq . Moreover at both the levels there is an 
immigration event at rate k on a, in which a molecule with a uniformly chosen type in E x [0, 1] 
is inserted at that level. In between these lookdowns and immigrations, the molecules at 
levels 1 and 2 are diffusing on the membrane according to independent speed D Brownian 
motions. 

The quantity S p can be calculated by tracing back the history from time t. Let T\ 2 be 
the last lookdown time between the first two levels and Tj be the last immigration time at 
level i for i = 1,2. The random variables T\ 2 , T\ and r 2 are independent and exponentially 
distributed with rates 2k^a, k on a and k on a respectively. Let r be the minimum of Ti,t 2 and 
T12 and so it is an exponential random variable with rate 2(k on + k^)a. 

The molecules at levels 1 and 2 will be in the same clan provided r = r 12 . Molecules at 
levels 1 and 2 were at the same place at time t — r and have been doing independent speed 
D Brownian motions on the sphere E since then. Using Lemma 4.5 we get 

E{\\Y l {t)-Y 2 {t)f\C l {t) = C 2 {t)) 

POO 

= AR 2 (k on + k a )a / ( 1 - e~W s \ e ~ 2 ^ +k ^ as ds 
2D 

{(k on + fcfb)a + Jr) 
This proves Theorem 2.7. 
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